Dynamic filter processing

ABSTRACT

Described are methods, systems and computer readable media for dynamic filter operations.

This application claims the benefit of U.S. Provisional Application No. 62/161,813, entitled “Computer Data System” and filed on May 14, 2015, which is incorporated herein by reference in its entirety.

Embodiments relate generally to computer data systems, and more particularly, to methods, systems and computer readable media for providing a dynamic data filter.

Filtering clauses can be used to narrow a larger data source into a focused subset of the larger data source based on one or more filtering criteria. For example, traditional Structured Query Language provides a “where” clause for filtering. In a system that has rapidly changing data sources, filtering is additionally complicated by the rapidly changing nature of the data sources. Filtering clauses can contain one or more filtering criteria that can be a single expression or a list of expressions kept in a separate table, file, or other data structure. This method for filtering with a list of expressions creates a static two step process of first retrieving the one or more filtering criteria from a list and then second, filtering a target data table by retrieving all the rows of data from the table where the criteria in the list are a match. An incomplete or incorrect result set can be obtained when a change occurs in the filtering criteria list after step one has been performed but before step two can be completed because the operation performed in step two is unaware of the changes in the filtering criteria list. A table join operation can also be used to join a filtering criteria table with a data table frequently to ensure that a change in the filtering criteria table will be added to the result. Such frequent joins of large tables can be resource expensive.

Embodiments were conceived in light of the above mentioned needs, problems and/or limitations, among other things.

Some implementations can include a system for automatically updating data source objects, the system comprising one or more hardware processors and a computer readable data storage device coupled to the one or more hardware processors, the computer readable data storage device having stored thereon software instructions that, when executed by the one or more hardware processors, cause the one or more hardware processors to perform operations. The operations can include creating a first data source object in memory and mapping the first data source object to a first stored data. The operation can also include creating a second data source object in memory and mapping the second data source object to a second stored data. The operations can further include creating a third data source object in memory and mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object. The operations can include creating a first listener for the third data source object in memory and listening with the first listener for one or more changes to the first data source object. The operations can also include making one or more changes to the first data source object. The operations can further include detecting by the first listener of one or more changes to the first data source object. The operation can include receiving a notification from the first listener of the change to the first data source object and then updating the mapping of the third data source object with the one or more changes to the first data source object.

The operations can further include creating a second listener for the second data source object in memory and listening with the second listener for one or more changes to the second data source object. The operations can include making one or more changes to the second data source object. The operations can also include detecting by the second listener of one or more changes to the second data source object. The operations can include receiving a notification of one or more changes to the second data source object and requesting a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object. The operations can further include updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object.

In some implementations, the mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object can include selecting a set of rows from the first stored data with one or more key values that are present in the second stored data.

In some implementations, the mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object can include selecting a set of rows form the first stored data with one or more key values that are not present in the second stored data.

In some implementations, the operations can further include creating a second listener for the second data source object in memory and listening with the second listener for one or more changes to the second data source object. The operations can include making one or more changes to the second data source object. The operations can also include detecting by the second listener of one or more changes to the second data source object. The operations can further include receiving a notification of one or more changes to the second data source object and determining whether to request a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object, the determination based on whether the one or more changes to the second data source object effected an overall change in the second data source. The operations can include updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object only if the one or more changes to the second data source object effected an overall change in the second data source.

A change to the first data source object can include at least one of adding a row to the first data source object, deleting a row from the first data source object changing the data in a row of the first data source object, and re-indexing the rows of the first data source object.

A change to the second data source object can include at least one of adding a row to the second data source object, deleting a row from the second data source object, changing the data in a row of the second data source object, and re-indexing the rows of the second data source object.

Some implementations can include a method for using a computer system to automatically update data source objects, the method comprising creating a first data source object in memory and mapping the first data source object to a first stored data. The method can also include creating a second data source object in memory and mapping the second data source object to a second stored data. The method can further include creating a third data source object in memory and mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object. The method can include creating a first listener for the third data source object in memory and listening with the first listener for one or more changes to the first data source object. The method can also include making one or more changes to the first data source object. The method can further include detecting by the first listener of one or more changes to the first data source object. The method can include receiving a notification from the first listener of the change to the first data source object and updating the mapping of the third data source object with the one or more changes to the first data source object.

The method can further include creating a second listener for the second data source object in memory and listening with the second listener for one or more changes to the second data source object. The method can include making one or more changes to the second data source object. The method can also include detecting by the second listener of one or more changes to the second data source object. The method can further include receiving a notification of one or more changes to the second data source object and requesting a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object. The method can also include updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object.

In some implementations, the mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object can include selecting a set of rows from the first stored data with one or more key values that are present in the second stored data.

In some implementations, the mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object includes selecting a set of rows form the first stored data with one or more key values that are not present in the second stored data.

The method can further include creating a second listener for the second data source object in memory listening with the second listener for one or more changes to the second data source object. The method can include making one or more changes to the second data source object. The method can also include detecting by the second listener of one or more changes to the second data source object. The method can further include receiving a notification of one or more changes to the second data source object and determining whether to request a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object, the determination based on whether the one or more changes to the second data source object effected an overall change in the second data source. The method can include updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object only if the one or more changes to the second data source object effected an overall change in the second data source.

A change to the first data source object can include at least one of adding a row to the first data source object, deleting a row from the first data source object, changing the data in a row of the first data source object, and re-indexing the rows of the first data source object.

A change to the second data source object can include at least one of adding a row to the second data source object, deleting a row from the second data source object, changing the data in a row of the second data source object, and re-indexing the rows of the second data source object.

Some implementations can include a nontransitory computer readable medium having stored thereon software instructions that, when executed by one or more processors, cause the one or more processors to perform operations. The operations can include creating a first data source object in memory and mapping the first data source object to a first stored data. The operations can also include creating a second data source object in memory and mapping the second data source object to a second stored data. The operations can further include creating a third data source object in memory and mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object. The operations can include creating a first listener for the third data source object in memory and listening with the first listener for one or more changes to the first data source object. The operations can also include making one or more changes to the first data source object. The operations can further include detecting by the first listener of one or more changes to the first data source object. The operations can include receiving a notification from the first listener of the change to the first data source object and updating the mapping of the third data source object with the one or more changes to the first data source object.

The operations can further include creating a second listener for the second data source object in memory and listening with the second listener for one or more changes to the second data source object. The operations can include making one or more changes to the second data source object. The operations can further include detecting by the second listener of one or more changes to the second data source object. The operations can also include receiving a notification of one or more changes to the second data source object and requesting a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object. The operations can further include updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object.

In some implementations, the mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object can include selecting a set of rows from the first stored data with one or more key values that are present in the second stored data.

The operations can further include creating a second listener for the second data source object in memory and listening with the second listener for one or more changes to the second data source object. The operations can include making one or more changes to the second data source object. The operations can also include detecting by the second listener of one or more changes to the second data source object. The operations can further include receiving a notification of one or more changes to the second data source object and determining whether to request a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object, the determination based on whether the one or more changes to the second data source object effected an overall change in the second data source. The operations can include updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object only if the one or more changes to the second data source object effected an overall change in the second data source.

A change to the first data source object can includes at least one of adding a row to the first data source object, deleting a row from the first data source object, changing the data in a row of the first data source object, and re-indexing the rows of the first data source object.

A change to the second data source object includes at least one of adding a row to the second data source object, deleting a row from the second data source object, changing the data in a row of the second data source object, and re-indexing the rows of the second data source object.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram of an example computer data system showing an example data distribution configuration in accordance with some implementations.

FIG. 2 is a diagram of an example computer data system showing an example administration/process control arrangement in accordance with some implementations.

FIG. 3 is a diagram of an example computing device configured for dynamic filter operations processing in accordance with some implementations.

FIG. 4 is a diagram of an example interest table in accordance with some implementations.

FIG. 5 is a diagram of an example data table in accordance with some implementations.

FIG. 6 is a diagram of an example interest filtered data table in accordance with some implementations.

FIG. 7 is a flowchart of an example dynamic filtering operation in accordance with some implementations.

FIG. 8 is a flowchart of an example dynamic filtering operation in accordance with some implementations.

DETAILED DESCRIPTION

Reference is made herein to the Java programming language, Java classes, Java bytecode and the Java Virtual Machine (JVM) for purposes of illustrating example implementations. It will be appreciated that implementations can include other programming languages (e.g., groovy, Scala, R, Go, etc.), other programming language structures as an alternative to or in addition to Java classes (e.g., other language classes, objects, data structures, program units, code portions, script portions, etc.), other types of bytecode, object code and/or executable code, and/or other virtual machines or hardware implemented machines configured to execute a data system query.

FIG. 1 is a diagram of an example computer data system and network 100 showing an example data distribution configuration in accordance with some implementations. In particular, the system 100 includes an application host 102, a periodic data import host 104, a query server host 106, a long-term file server 108, and a user data import host 110. While tables are used as an example data object in the description below, it will be appreciated that the data system described herein can also process other data objects such as mathematical objects (e.g., a singular value decomposition of values in a given range of one or more rows and columns of a table), TableMap objects, etc. A TableMap object provides the ability to lookup a Table by some key. This key represents a unique value (or unique tuple of values) from the columns aggregated on in a byExternal( ) statement execution, for example. A TableMap object can be the result of a byExternal( ) statement executed as part of a query. It will also be appreciated that the configurations shown in FIGS. 1 and 2 are for illustration purposes and in a given implementation each data pool (or data store) may be directly attached or may be managed by a file server.

The application host 102 can include one or more application processes 112, one or more log files 114 (e.g., sequential, row-oriented log files), one or more data log tailers 116 and a multicast key-value publisher 118. The periodic data import host 104 can include a local table data server, direct or remote connection to a periodic table data store 122 (e.g., a column-oriented table data store) and a data import server 120. The query server host 106 can include a multicast key-value subscriber 126, a performance table logger 128, local table data store 130 and one or more remote query processors (132, 134) each accessing one or more respective tables (136, 138). The long-term file server 108 can include a long-term data store 140. The user data import host 110 can include a remote user table server 142 and a user table data store 144. Row-oriented log files and column-oriented table data stores are discussed herein for illustration purposes and are not intended to be limiting. It will be appreciated that log files and/or data stores may be configured in other ways. In general, any data stores discussed herein could be configured in a manner suitable for a contemplated implementation.

In operation, the input data application process 112 can be configured to receive input data from a source (e.g., a securities trading data source), apply schema-specified, generated code to format the logged data as it's being prepared for output to the log file 114 and store the received data in the sequential, row-oriented log file 114 via an optional data logging process. In some implementations, the data logging process can include a daemon, or background process task, that is configured to log raw input data received from the application process 112 to the sequential, row-oriented log files on disk and/or a shared memory queue (e.g., for sending data to the multicast publisher 118). Logging raw input data to log files can additionally serve to provide a backup copy of data that can be used in the event that downstream processing of the input data is halted or interrupted or otherwise becomes unreliable.

A data log tailer 116 can be configured to access the sequential, row-oriented log file(s) 114 to retrieve input data logged by the data logging process. In some implementations, the data log tailer 116 can be configured to perform strict byte reading and transmission (e.g., to the data import server 120). The data import server 120 can be configured to store the input data into one or more corresponding data stores such as the periodic table data store 122 in a column-oriented configuration. The periodic table data store 122 can be used to store data that is being received within a time period (e.g., a minute, an hour, a day, etc.) and which may be later processed and stored in a data store of the long-term file server 108. For example, the periodic table data store 122 can include a plurality of data servers configured to store periodic securities trading data according to one or more characteristics of the data (e.g., a data value such as security symbol, the data source such as a given trading exchange, etc.).

The data import server 120 can be configured to receive and store data into the periodic table data store 122 in such a way as to provide a consistent data presentation to other parts of the system. Providing/ensuring consistent data in this context can include, for example, recording logged data to a disk or memory, ensuring rows presented externally are available for consistent reading (e.g., to help ensure that if the system has part of a record, the system has all of the record without any errors), and preserving the order of records from a given data source. If data is presented to clients, such as a remote query processor (132, 134), then the data may be persisted in some fashion (e.g., written to disk).

The local table data server 124 can be configured to retrieve data stored in the periodic table data store 122 and provide the retrieved data to one or more remote query processors (132, 134) via an optional proxy.

The remote user table server (RUTS) 142 can include a centralized consistent data writer, as well as a data server that provides processors with consistent access to the data that it is responsible for managing. For example, users can provide input to the system by writing table data that is then consumed by query processors.

The remote query processors (132, 134) can use data from the data import server 120, local table data server 124 and/or from the long-term file server 108 to perform queries. The remote query processors (132, 134) can also receive data from the multicast key-value subscriber 126, which receives data from the multicast key-value publisher 118 in the application host 102. The performance table logger 128 can log performance information about each remote query processor and its respective queries into a local table data store 130. Further, the remote query processors can also read data from the RUTS, from local table data written by the performance logger, or from user table data read over NFS.

It will be appreciated that the configuration shown in FIG. 1 is a typical example configuration that may be somewhat idealized for illustration purposes. An actual configuration may include one or more of each server and/or host type. The hosts/servers shown in FIG. 1 (e.g., 102-110, 120, 124 and 142) may each be separate or two or more servers may be combined into one or more combined server systems. Data stores can include local/remote, shared/isolated and/or redundant. Any table data may flow through optional proxies indicated by an asterisk on certain connections to the remote query processors. Also, it will be appreciated that the term “periodic” is being used for illustration purposes and can include, but is not limited to, data that has been received within a given time period (e.g., millisecond, second, minute, hour, day, week, month, year, etc.) and which has not yet been stored to a long-term data store (e.g., 140).

FIG. 2 is a diagram of an example computer data system 200 showing an example administration/process control arrangement in accordance with some implementations. The system 200 includes a production client host 202, a controller host 204, a GUI host or workstation 206, and query server hosts 208 and 210. It will be appreciated that there may be one or more of each of 202-210 in a given implementation.

The production client host 202 can include a batch query application 212 (e.g., a query that is executed from a command line interface or the like) and a real time query data consumer process 214 (e.g., an application that connects to and listens to tables created from the execution of a separate query). The batch query application 212 and the real time query data consumer 214 can connect to a remote query dispatcher 222 and one or more remote query processors (224, 226) within the query server host 1 208.

The controller host 204 can include a persistent query controller 216 configured to connect to a remote query dispatcher 232 and one or more remote query processors 228-230. In some implementations, the persistent query controller 216 can serve as the “primary client” for persistent queries and can request remote query processors from dispatchers, and send instructions to start persistent queries. For example, a user can submit a query to 216, and 216 starts and runs the query every day. In another example, a securities trading strategy could be a persistent query. The persistent query controller can start the trading strategy query every morning before the market open, for instance. It will be appreciated that 216 can work on times other than days. In some implementations, the controller may require its own clients to request that queries be started, stopped, etc. This can be done manually, or by scheduled (e.g., cron) jobs. Some implementations can include “advanced scheduling” (e.g., auto-start/stop/restart, time-based repeat, etc.) within the controller.

The GUI/host workstation can include a user console 218 and a user query application 220. The user console 218 can be configured to connect to the persistent query controller 216. The user query application 220 can be configured to connect to one or more remote query dispatchers (e.g., 232) and one or more remote query processors (228, 230).

FIG. 3 is a diagram of an example computing device 300 in accordance with at least one implementation. The computing device 300 includes one or more processors 302, operating system 304, computer readable medium 306 and network interface 308. The memory 306 can include remote query processor application 310 and a data section 312 (e.g., for storing ASTs, precompiled code, etc.).

In operation, the processor 302 may execute the application 310 stored in the memory 306. The application 310 can include software instructions that, when executed by the processor, cause the processor to perform operations for executing and updating queries and dynamic filter operations in accordance with the present disclosure (e.g., performing one or more of 702-712, 802-822 described below).

The application program 310 can operate in conjunction with the data section 312 and the operating system 304.

Large data systems can be dynamic in nature with continuing steams of data being added by the second or even the microsecond. Users of a large data system may only be interested in a subset of the large data. For example, thousands of stock symbols exist but a user may only desire to follow a few favorites. To that end, a user may keep those favorites in a list that can be routinely updated over time. The user can use the favorites list to filter the the large data source to retrieve only the data of interest. The filtering can occur every microsecond, second, minute, hour, day, or longer depending on how quickly the data is being added, deleted, or modified in the large data source. After the initial filtering, only supplemental filtering of the added, deleted, modified, re-indexed data can be required to keep the user up to date as long as the user does not change the favorites list. If the favorites list changes through a deletion or addition in the list, the complete large data source can be filtered to bring the user's result set up to date. To relieve the system from constantly re-filtering the large data set to keep the user up to date, the system can create listeners to monitor for changes to the favorites list and the large data source. If the listener detects an effective change in the favorites list, the system then knows to re-filter the full large data source, but if the listener only detects changes to the large data source, the system knows to only do supplemental updates to the user's result set.

FIG. 4 is a diagram of an example of an interest data source that can be an interest table (user's favorites list) 400 in accordance with some implementations. The interest table 400 can contain one or more rows of data. The one or more rows of data in an interest table 400 can be used to provide filter parameters for filtering another data source. For example, the interest table 400 can contain interest data such as a stock symbol column 402 that can contain stock symbols (AAPL, SPY) that are of interest for filtering a larger data source that can contain additional information about the stock symbols AAPL and SPY.

It will be appreciated that an interest data source can be stored in forms and formats other than a table, such as a table object, a flat file, an array, or the like. It will also be appreciated that interest data is not limited to a single column or field. For example, the interest data could occupy one or more columns or fields that contain key values for filtering such as Symbol or Symbol and Price.

FIG. 5 is a diagram of an example data source that can be a quotes received table 500 in accordance with some implementations. The data source can contain any selection of data. For example, a data source can contain stock symbols 502, the associated quote date 504, associated quote time 506, and the associated quote 508 that occurred on the quote date 504 and at the quote time 506.

It will be appreciated that the data source can be stored in forms and formats other than a table, such as a table object, a flat file, an array, or the like. It will also be appreciated that the data source is not limited to a particular number of columns or fields. For example, the data source could expand to as many columns or fields that can be supported by the data source system.

FIG. 6 is a diagram of an example filtered data source that can be a filtered quote table 600 in accordance with some implementations. The filtered quote table can be the result of the quotes received table 500 filtered by stock symbol 402 of the interest table 400. For example, quotes received table 500 with stock symbol 502 can contain quotes received over time for stock symbols AAPL, CMI, and SPY. The interest table can contain stock symbol 402 that can contain AAPL and SPY. If quotes received table 500 is filtered by selecting only the rows from the interest table 400 that contain symbols from stock symbol 402, the resulting table can be the filtered quotes table 600 that only contains rows with stock symbol 602 that match contents of stock symbol 402.

It will be appreciated that more than one column from an interest table can be used to filter a data source.

It will also be appreciated that a variety of filtering logic can be used in conjunction with the interest table. Selection based on values found in or not found in the interest table are two examples. Other examples include, but are not limited to, applying one or more formulas, less than or equal to and/or greater than or equal qualifiers.

FIG. 7 is a flowchart of an example flow of a dynamic filtering operation 700 using an interest data source and a data source in accordance with some implementations. The components of the example dynamic filtering operation 700 can be a ticking table A 702, a ticking table B 704, a table A modification listener 706, a table B modification listener 710, a filtered results table C 708, and a request to perform a full filtering of table A 712 to update table C 708.

Ticking tables such as table A 702 and table B 704 can be data sources that are changing frequently or that can change. For example, changes can occur due to an addition of one or more rows, a modification to one or more existing rows, deletion of one or more rows, or re-indexing. Re-indexing can be the same data but with different row locations. An example of table A can be the quotes received table 500. An example of table B can be the interest table 400.

It will be appreciated that changes that can occur to data sources are not limited to an addition of one or more rows, a modification to one or more existing rows, deletion of one or more rows, or re-indexing. For example, changes such as column additions, column deletions, column merges, row merges, or the like can occur.

It will be appreciated that table A 702 and table B 704 can change asynchronously. For example, table A 702 can be a table that adds new rows every microsecond, second, minute, hour, day or the like. Table B 704 can be a table that never or rarely adds, modifies, or deletes rows. The changes to table A 702 can be made independent of changes to table B 704 and the changes to table B 704 can be made independent of the changes to table A 702.

Table modification listeners such as table A modification listener 706 and table B modification listener 710 can be a software construct associated with a changing data source that can listen for events or changes that can occur in a changing data source. Examples of events or changes can include an addition of one or more rows to a table, a modification of one or more rows of a table, a deletion of one or more rows of a table, or a re-indexing of the rows of a table. A modification listener (706, 710) can trigger filtering to occur after an event or change is detected by the modification listener (706, 710).

Filtered data source results such as table C 708 can be a filtered result of table A. An example of table C can be the filtered quotes table 600. Table C can be formed by an example command such as table_C=table A.DynamicFilteringOperation (table_B, “interest column”). The DynamicFilteringOperation portion of the command can alert a compiler or an interpreter that the filter will remain dynamic through the life of table C. The table B portion of the command can alert a compiler or an interpreter that table B will provide the filtering by designation of the table B filtering column or columns, “interest column.”

It will be appreciated that an example command such as table_C=table_A NotInDynamicFilteringOperation (table_B, “interest column”) can create a resultant table C that does not contain rows that contain any of the items designated in the table B interest column.

It will also be appreciated that a formula or formulas can be substituted for interest column or columns.

FIG. 7 demonstrates example flow possibilities for updating a table C that has already been created from the filtering of table A at least once with a table B through the application of an DynamicFilteringOperation command in accordance with some implementations. As part of the application of the DynamicFilteringOperation command, listeners 706, 710 for input to table A 702 and table B 704 can be configured to listen for any changes to table A 702 and table B 704 respectively, in order to determine when, where, and how to apply the dynamic filter operation.

Listener 706 can continuously listen for changes to table A 702. If the listener 706 detects a change to table A 702 through either an addition of one or more rows, a deletion of one or more rows, a modification of one or more rows, or a re-indexing of table A 702, the listener 706 can trigger a re-filtering of table C 708 for only those rows affected by the addition, deletion, modification or re-indexing.

It will be appreciated that changes that can be detected by the listener are not limited to an addition of one or more rows, a modification to one or more existing rows, deletion of one or more rows, or re-indexing. For example, changes such as column additions, column deletions, column merges, row merges, or the like can be detected.

Listener 710 can continuously listen for messages containing changes to table B 704. If listener 710 does not detect a message regarding an addition of one or more rows, a deletion of one or more rows, or a modification of one or more rows, re-indexing or other message types in table B 704, listener 710 does not take any action toward re-filtering table C 708. If listener 710 detects an addition of one or more rows, a deletion of one or more rows, or a modification of one or more rows, re-indexing, or other message types in table B 704, listener 710 can initiate a request for full table filtering 712 of table A 702, which causes table C 708 to be updated to reflect the new interest set in table B 704. The updated table C 708 can then send a notification message of the changes to any downstream listeners for children created from operations on table C. This can be an equivalent replacement of table C without the table C object being deleted and recreated. The listener 710 can also maintain additional state to prevent re-filtering when modifications to table B 704 does not result in a new interest set, for example, adding and removing rows with duplicate values.

It will be appreciated that filtering on only changed table A 702 rows and only completing a full filtering of table A 702 when table B 704 changes can provide a significant system efficiency savings for large tables or large data sources.

It will be appreciated that a DynamicFilteringOperation can be implemented with constructs other than listeners, such as any construct that can monitor events such as an addition of one or more rows, a deletion of one or more rows, a modification of one or more rows, or re-indexing in a table or other data source.

It will also be appreciated that a DynamicFilteringOperation can be executed in a remote query processor application 310 but is not limited to being executed in a remote processor application.

FIG. 8 is a diagram of an example dynamic filtering operation 800 using the example tables from FIGS. 4, 5, and 6 in accordance with some implementations. Processing begins at 802, when a quotes received table 500 is created and populated with data. Alternatively, processing can begin at 804 with the creation and populating of an interest table 400. The quotes received table 500 and the interest table 400 can also be created and populated simultaneously. Processing continues to 806 and 808.

It will be appreciated that a dynamic filtering operation can be executed in a remote query processor application 310 but is not limited to being executed in a remote processor application.

At 806, a listener is created to detect changes to the quotes received table 500. Changes that can occur to the quotes received table 500 include an addition of rows, a deletion of rows, a modification of row content, or a re-indexing of rows.

It will be appreciated that changes that can be detected by the listener are not limited to an addition of one or more rows, a modification to one or more existing rows, deletion of one or more rows, or re-indexing. For example, changes such as column additions, column deletions, column merges, row merges, or the like can be detected.

At 808, a listener is created to detect messages containing changes to the interest table 400. Examples of messages of changes that can occur to the interest table 400 include an addition of rows, a deletion of rows, a modification of row content, a re-indexing, or other message types. It will be appreciated that the creation of the listener 806, 808, follows the creation of the associated table, respectively quotes received table 500 and interest table 400. Accordingly, whether listener 806 precedes the creation of listener 808 or whether listener 808 precedes the creation of listener 806 or whether listener 808 and listener 806 are created simultaneously depends on the timing of the creation of the quotes received table 500 and the interest table 400. Processing continues to 810.

At 810, the filtered quotes table 600 can be created by executing the following example dynamic filtering operation command: Filtered_Quotes_Table=Quotes_Received_Table.WhereDynamicIn (Interest_Table, “Stock_Symbol”). The execution of the dynamic filtering operation command also configures the listeners (806, 808) to trigger an update to the the filtered quotes table (600) for a change detected to the quotes received table 500 and to trigger a full filtering of the quotes received table 500 causing a full update of the filtered quotes table 600 for a change detected to the interest table 400. Processing continues to 812.

It will be appreciated that 812 and 818 and their connected next steps can be run in parallel or asynchronously. For clarity of process, steps 812 through 816 are addressed first before returning to 818.

At 812, the listener detects whether one or more rows have been added, modified, deleted or re-indexed in the quotes received table 500. Processing continues to 814.

At 814, when the listener detects the addition, modification, deletion or other change, the listener triggers the execution of the dynamic filtering command on only the added, modified, deleted, or changed portion of the quotes received table 500. Processing continues to 816.

At 816, the filtered quotes table 600 is updated with only the changes made to the quotes received table 500. For example, if a new row for AAPL has been added to the quotes received table 500, then the dynamic filter is executed on that row. The filtered quotes table 600 is updated with the new AAPL row because AAPL is also found in the interest table 400. In another example, if a new row for CMI has been added to the quotes received table 500, then the dynamic filter is executed on that row. But the filtered quotes table 600 is not updated with the new CMI row because CMI is not found in the interest table 400. Process returns to 812.

At 812, the process from 812 to 816 will continue to loop as long as the dynamic filter command remains active. Continue discussion of flowchart at 818.

At 818, the listener created in 810 listens for the addition of one or more rows, the modification of one or more rows, the deletion of one or more rows, or other changes to the interest table 400. Processing continues to 820.

At 820, if the listener detects the addition, modification, deletion or other change that can result in a change to the interest set, the listener triggers the execution of the dynamic filtering operation command on the entirety of the quotes received table 500. For example, if CMI is added to the interest table, then the entire quotes received table 500 will be filtered on AAPL, CMI, and SPY to pick up all the CMI rows that were not previously part of the filtered quotes table 600.

It will be appreciated that in some cases, the system may not need to apply the change to the entirety of the table, thus avoiding the need to re-compute the entirety of the filter operation. For example, if the interest table only had one row removed, the system can update only the removed element rather than re-compute the whole table. Processing continues to 822.

At 822, the filtered quotes table is updated by applying the dynamic filtering to the entirety of the quotes received table. Process returns to 818.

At 818, the process from 818 to 822 will continue to loop as long as the dynamic filtering operation command remains active.

It will be appreciated that the modules, processes, systems, and sections described above can be implemented in hardware, hardware programmed by software, software instructions stored on a nontransitory computer readable medium or a combination of the above. A system as described above, for example, can include a processor configured to execute a sequence of programmed instructions stored on a nontransitory computer readable medium. For example, the processor can include, but not be limited to, a personal computer or workstation or other such computing system that includes a processor, microprocessor, microcontroller device, or is comprised of control logic including integrated circuits such as, for example, an Application Specific Integrated Circuit (ASIC), a field programmable gate array (FPGA), graphics processing unit (GPU), or the like. The instructions can be compiled from source code instructions provided in accordance with a programming language such as Java, C, C++, C#.net, assembly or the like. The instructions can also comprise code and data objects provided in accordance with, for example, the Visual Basic™ language, a specialized database query language, or another structured or object-oriented programming language. The sequence of programmed instructions, or programmable logic device configuration software, and data associated therewith can be stored in a nontransitory computer-readable medium such as a computer memory or storage device which may be any suitable memory apparatus, such as, but not limited to ROM, PROM, EEPROM, RAM, flash memory, disk drive and the like.

Furthermore, the modules, processes systems, and sections can be implemented as a single processor or as a distributed processor. Further, it should be appreciated that the steps mentioned above may be performed on a single or distributed processor (single and/or multi-core, or cloud computing system). Also, the processes, system components, modules, and sub-modules described in the various figures of and for embodiments above may be distributed across multiple computers or systems or may be co-located in a single processor or system. Example structural embodiment alternatives suitable for implementing the modules, sections, systems, means, or processes described herein are provided below.

The modules, processors or systems described above can be implemented as a programmed general purpose computer, an electronic device programmed with microcode, a hard-wired analog logic circuit, software stored on a computer-readable medium or signal, an optical computing device, a networked system of electronic and/or optical devices, a special purpose computing device, an integrated circuit device, a semiconductor chip, and/or a software module or object stored on a computer-readable medium or signal, for example.

Embodiments of the method and system (or their sub-components or modules), may be implemented on a general-purpose computer, a special-purpose computer, a programmed microprocessor or microcontroller and peripheral integrated circuit element, an ASIC or other integrated circuit, a digital signal processor, a hardwired electronic or logic circuit such as a discrete element circuit, a programmed logic circuit such as a PLD, PLA, FPGA, PAL, or the like. In general, any processor capable of implementing the functions or steps described herein can be used to implement embodiments of the method, system, or a computer program product (software program stored on a nontransitory computer readable medium).

Furthermore, embodiments of the disclosed method, system, and computer program product (or software instructions stored on a nontransitory computer readable medium) may be readily implemented, fully or partially, in software using, for example, object or object-oriented software development environments that provide portable source code that can be used on a variety of computer platforms. Alternatively, embodiments of the disclosed method, system, and computer program product can be implemented partially or fully in hardware using, for example, standard logic circuits or a VLSI design. Other hardware or software can be used to implement embodiments depending on the speed and/or efficiency requirements of the systems, the particular function, and/or particular software or hardware system, microprocessor, or microcomputer being utilized. Embodiments of the method, system, and computer program product can be implemented in hardware and/or software using any known or later developed systems or structures, devices and/or software by those of ordinary skill in the applicable art from the function description provided herein and with a general basic knowledge of the software engineering and computer networking arts.

Moreover, embodiments of the disclosed method, system, and computer readable media (or computer program product) can be implemented in software executed on a programmed general purpose computer, a special purpose computer, a microprocessor, or the like.

It is, therefore, apparent that there is provided, in accordance with the various embodiments disclosed herein, methods, systems and computer readable media for dynamic filter operations.

Application Ser. No. 15/154,974, entitled “DATA PARTITIONING AND ORDERING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,975, entitled “COMPUTER DATA SYSTEM DATA SOURCE REFRESHING USING AN UPDATE PROPAGATION GRAPH” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,979, entitled “COMPUTER DATA SYSTEM POSITION-INDEX MAPPING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,980, entitled “SYSTEM PERFORMANCE LOGGING OF COMPLEX REMOTE QUERY PROCESSOR QUERY OPERATIONS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,983, entitled “DISTRIBUTED AND OPTIMIZED GARBAGE COLLECTION OF REMOTE AND EXPORTED TABLE HANDLE LINKS TO UPDATE PROPAGATION GRAPH NODES” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,984, entitled “COMPUTER DATA SYSTEM CURRENT ROW POSITION QUERY LANGUAGE CONSTRUCT AND ARRAY PROCESSING QUERY LANGUAGE CONSTRUCTS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,985, entitled “PARSING AND COMPILING DATA SYSTEM QUERIES” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,987, entitled “DYNAMIC FILTER PROCESSING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,988, entitled “DYNAMIC JOIN PROCESSING USING REAL-TIME MERGED NOTIFICATION LISTENER” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,990, entitled “DYNAMIC TABLE INDEX MAPPING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,991, entitled “QUERY TASK PROCESSING BASED ON MEMORY ALLOCATION AND PERFORMANCE CRITERIA” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,993, entitled “A MEMORY-EFFICIENT COMPUTER SYSTEM FOR DYNAMIC UPDATING OF JOIN PROCESSING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,995, entitled “QUERY DISPATCH AND EXECUTION ARCHITECTURE” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,996, entitled “COMPUTER DATA DISTRIBUTION ARCHITECTURE” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,997, entitled “DYNAMIC UPDATING OF QUERY RESULT DISPLAYS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,998, entitled “DYNAMIC CODE LOADING” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/154,999, entitled “IMPORTATION, PRESENTATION, AND PERSISTENT STORAGE OF DATA” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,001, entitled “COMPUTER DATA DISTRIBUTION ARCHITECTURE” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,005, entitled “PERSISTENT QUERY DISPATCH AND EXECUTION ARCHITECTURE” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,006, entitled “SINGLE INPUT GRAPHICAL USER INTERFACE CONTROL ELEMENT AND METHOD” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,007, entitled “GRAPHICAL USER INTERFACE DISPLAY EFFECTS FOR A COMPUTER DISPLAY SCREEN” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,009, entitled “COMPUTER ASSISTED COMPLETION OF HYPERLINK COMMAND SEGMENTS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,010, entitled “HISTORICAL DATA REPLAY UTILIZING A COMPUTER SYSTEM” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,011, entitled “DATA STORE ACCESS PERMISSION SYSTEM WITH INTERLEAVED APPLICATION OF DEFERRED ACCESS CONTROL FILTERS” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

Application Ser. No. 15/155,012, entitled “REMOTE DATA OBJECT PUBLISHING/SUBSCRIBING SYSTEM HAVING A MULTICAST KEY-VALUE PROTOCOL” and filed in the United States Patent and Trademark Office on May 14, 2016, is hereby incorporated by reference herein in its entirety as if fully set forth herein.

While the disclosed subject matter has been described in conjunction with a number of embodiments, it is evident that many alternatives, modifications and variations would be, or are, apparent to those of ordinary skill in the applicable arts. Accordingly, Applicants intend to embrace all such alternatives, modifications, equivalents and variations that are within the spirit and scope of the disclosed subject matter. 

What is claimed is:
 1. A system for automatically updating data source objects, the system comprising: one or more hardware processors; a computer readable data storage device coupled to the one or more hardware processors, the computer readable data storage device having stored thereon software instructions that, when executed by the one or more hardware processors, cause the one or more hardware processors to perform operations including: creating a first data source object in a first memory; mapping the first data source object to a first stored data; creating a second data source object in a second memory, the second data source object different than the first data source object; mapping the second data source object to a second stored data; creating a third data source object in a third memory, the third data source object different than the first data source object and the second data source object; mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object; receiving a notification of one or more changes to the second data source object; and requesting a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object.
 2. The system of claim 1, the operations further comprising: updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object.
 3. The system of claim 1, wherein the mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object includes selecting a set of rows from the first stored data with one or more key values that are present in the second stored data.
 4. The system of claim 1, wherein the mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object includes selecting a set of rows form the first stored data with one or more key values that are not present in the second stored data.
 5. The system of claim 2, the operations further comprising: determining whether to request a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object, the determination based on whether the one or more changes to the second data source object effected an overall change in the second data source, wherein the updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object is performed only when the one or more changes to the second data source object effected an overall change in the second data source.
 6. The system of claim 1, wherein the change to the first data source object includes at least re-indexing the rows of the first data source object.
 7. The system of claim 1, wherein a change to the second data source object includes at least one of: adding a row to the second data source object; deleting a row from the second data source object; changing the data in a row of the second data source object; and re-indexing the rows of the second data source object.
 8. The system of claim 1, wherein the first memory, the second memory, and the third memory are all different.
 9. The system of claim 1, the operations further comprising: sending a notification message of changes to the third data source object to any downstream listeners of one or more children created from operations on the third data source object.
 10. The system of claim 1, wherein the remapping includes remapping of the third data source object to a second subset of first stored data by full data filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object; and wherein the first data source object is different than the second data source object.
 11. A method for using a computer system to automatically update data source objects, the method comprising: mapping a first data source object to a first stored data; mapping a second data source object to a second stored data; mapping a third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object; receiving a notification of one or more changes to the second data source object; and requesting a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object.
 12. The method of claim 11, further comprising: updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object.
 13. The method of claim 11, wherein the mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object includes selecting a set of rows from the first stored data with one or more key values that are present in the second stored data.
 14. The method of claim 11, wherein the mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object includes selecting a set of rows form the first stored data with one or more key values that are not present in the second stored data.
 15. The method of claim 12, further comprising: determining whether to request a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object, the determination based on whether the one or more changes to the second data source object effected an overall change in the second data source, wherein the updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object is performed only when the one or more changes to the second data source object effected an overall change in the second data source.
 16. The method of claim 11, wherein a change to the first data source object includes at least re-indexing the rows of the first data source object.
 17. The method of claim 11, wherein a change to the second data source object includes at least one of: adding a row to the second data source object; deleting a row from the second data source object; changing the data in a row of the second data source object; and re-indexing the rows of the second data source object.
 18. The method of claim 11, wherein the first data source object, the second data source object, and the third data source object are all stored in different memory devices.
 19. The method of claim 11, further comprising: sending a notification message of changes to the third data source object to any downstream listeners of one or more children created from operations on the third data source object.
 20. The method of claim 11, the operations further comprising: wherein the remapping includes remapping of the third data source object to a second subset of first stored data by full data filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object; and wherein the first data source object is different than the second data source object.
 21. A non-transitory computer readable medium having stored thereon software instructions that, when executed by one or more processors, cause the one or more processors to perform operations including: mapping a first data source object to a first stored data; mapping a second data source object to a second stored data; mapping a third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object; receiving a notification of one or more changes to the second data source object; and requesting a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object.
 22. The non-transitory computer readable medium of claim 21, the operations further comprising: updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object.
 23. The non-transitory computer readable medium of claim 21, wherein mapping the third data source object to a first subset of the first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object includes selecting a set of rows from the first stored data with one or more key values that are present in the second stored data.
 24. The non-transitory computer readable medium of claim 22, the operations further comprising: determining whether to request a remapping of the third data source object to a second subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object, the determination based on whether the one or more changes to the second data source object effected an overall change in the second data source, wherein the updating the mapping of the third data source object to a subset of first stored data by filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object is performed only when the one or more changes to the second data source object effected an overall change in the second data source.
 25. The non-transitory computer readable medium of claim 21, wherein a change to the first data source object includes at least re-indexing the rows of the first data source object.
 26. The non-transitory computer readable medium of claim 21, wherein a change to the second data source object includes at least one of: adding a row to the second data source object; deleting a row from the second data source object; changing the data in a row of the second data source object; and re-indexing the rows of the second data source object.
 27. The non-transitory computer readable medium of claim 21, wherein the first data source object, the second data source object, and the third data source object are all stored in different memory devices.
 28. The non-transitory computer readable medium of claim 21, the operations further comprising: sending a notification message of changes to the third data source object to any downstream listeners of one or more children created from operations on the third data source object.
 29. The non-transitory computer readable medium of claim 21, the operations further comprising: wherein the remapping includes remapping of the third data source object to a second subset of first stored data by full data filtering the first stored data mapped to the first data source object with the second stored data mapped to the second data source object; and wherein the first data source object is different than the second data source object. 